How Well Does Language-based Community Detection Work for Reddit?
نویسندگان
چکیده
Online communities in the modern day era are becoming more and more important. This makes it imperative for us to understand the structure of these communities. In addition, content generation sites like Reddit, Tumblr and Quora have an abundance of text in comments and posts which can be used to model the user interactions and network substructures. In this paper, we propose to study community detection in Reddit solely based on language features, to better understand how well language informs the boundaries between different communities. We use supervised prediction tasks and unsupervised community detection to gauge the quality of these features and find that they provide a fairly robust signal in trying to understand and model user interactions in the network.
منابع مشابه
Talking to the crowd: What do people react to in online discussions?
This paper addresses the question of how language use affects community reaction to comments in online discussion forums, and the relative importance of the message vs. the messenger. A new comment ranking task is proposed based on community annotated karma in Reddit discussions, which controls for topic and timing of comments. Experimental work with discussion threads from six subreddits shows...
متن کاملGiving Gold: Understanding Appreciation in Reddit Communities
In the social media age, our actions are constantly evaluated by other users, many of whom are strangers. While anonymous users are often associated with adversarial behaviors, such as scamming and trolling, they are also capable of positive interactions. Community appreciation, a positive public evaluation of an individual’s content within a community, is found in nearly every social network; ...
متن کاملBetter When It Was Smaller? Changes in Online Community Content and Behavior Following Massive Growth
Online communities have a love-hate relationship with membership growth: new members bring fresh perspectives, but old-timers worry that growth interrupts the community’s social dynamic and lowers content quality. To arbitrate these two theories, we analyze over 45 million comments from 10 Reddit subcommunities following an exogenous shock when each subcommunity was added to the default set for...
متن کاملFormulation of Language Teachers̕ Identity in the Situated Learning of Language Teaching Community of Practice
A community of practice may shape and reshape the identity of members of the community through providing them with situated learning or learning environment. This study, therefore, is to clarify the salient learning-based features of the language teaching community of practice that might formulate the identity of language teachers. To this end, the study examined how learning situations in two ...
متن کاملBringing Classroom-Based Assessment into the EFL classroom
This paper describes how English as a Foreign Language (EFL) teachers can bring reliable, valid, user-friendly assessment into their classrooms, and thus improve the quality of learning that occurs there. Based on the experience of the author as a an EFL teacher and teacher-trainer, it is suggested that the promotion and development of autonomy, intrinsic motivation...
متن کامل